FACES: Diversity-Aware Entity Summarization Using Incremental Hierarchical Conceptual Clustering

نویسندگان

  • Kalpa Gunaratna
  • Krishnaprasad Thirunarayan
  • Amit P. Sheth
چکیده

Semantic Web documents that encode facts about entities on the Web have been growing rapidly in size and evolving over time. Creating summaries on lengthy Semantic Web documents for quick identification of the corresponding entity has been of great contemporary interest. In this paper, we explore automatic summarization techniques that characterize and enable identification of an entity and create summaries that are human friendly. Specifically, we highlight the importance of diversified (faceted) summaries by combining three dimensions: diversity, uniqueness, and popularity. Our novel diversity-aware entity summarization approach mimics human conceptual clustering techniques to group facts, and picks representative facts from each group to form concise (i.e., short) and comprehensive (i.e., improved coverage through diversity) summaries. We evaluate our approach against the state-of-the-art techniques and show that our work improves both the quality and the efficiency of entity summarization.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Semantics-based Summarization of Entities in Knowledge Graphs

Gunaratna, Kalpa. PhD, Department of Computer Science and Engineering, Wright State University, 2017. Semantics-based Summarization of Entities in Knowledge Graphs. The processing of structured and semi-structured content on the Web has been gaining attention with the rapid progress in the Linking Open Data project and the development of commercial knowledge graphs. Knowledge graphs capture dom...

متن کامل

Human Action Attribute Learning From Video Data Using Low-Rank Representations

Representation of human actions as a sequence of human body movements or action attributes enables the development of models for human activity recognition and summarization. We present an extension of the low-rank representation (LRR) model, termed the clustering-aware structure-constrained low-rank representation (CS-LRR) model, for unsupervised learning of human action attributes from video ...

متن کامل

Hierarchical Summarizing and Evaluating for Web Pages

In this investigation we propose a novel summarization method of Web pages using hierarchical expression. We discuss close relationship between summarization and hierarchical clustering to obtain the results, and we examine how to evaluate hierarchical summarization based on both correlation and structural aspects. We describe some experimental results using NTCIR Web documents to examine our m...

متن کامل

Query Specific ROCK Clustering Algorithm for Text Summarization

The idea of Data Mining has become very popular in recent years. Data Mining is the notion of all methods and techniques, which allow analyzing very large data sets to extract and discover previously unknown structures and relations out of such huge heaps of details. Data clustering is an important technique for exploratory data analysis. Clustering is a data mining (machine learning) technique...

متن کامل

Document Clustering and Summarization Based on Association Rule Mining for Dynamic Environment

Document Summarization is a technique, which reduces the size of the documents and gives the outline and crisp information about the given group of documents. This paper introduces a new update summarization algorithm incorporating association rule mining and correlated concept based hierarchical clustering for dynamic environment. In this algorithm, the associated concepts are extracted using ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015